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Abstract 

A contemporary perspective on the tradeoff between transmit antenna diversity and 
spatial multiplexing is provided. It is argued that, in the context of most modern wire- 
less systems and for the operating points of interest, transmission techniques that utilize 
all available spatial degrees of freedom for multiplexing outperform techniques that explic- 
itly sacrifice spatial multiplexing for diversity. In the context of such systems, therefore, 
there essentially is no decision to be made between transmit antenna diversity and spatial 
multiplexing in MIMO communication. Reaching this conclusion, however, requires that 
the channel and some key system features be adequately modeled and that suitable perfor- 
mance metrics be adopted; failure to do so may bring about starkly different conclusions. As 
a specific example, this contrast is illustrated using the 3GPP Long-Term Evolution system 
design. 



I Introduction 

Multipath fading is one of the most fundamental features of wireless channels. Because 
multiple received replicas of the transmitted signal sometimes combine destructively, 
there is a significant probability of severe fades. Without any means of mitigating such 
fading, ensuring reasonable reliability requires hefty power margins. 
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Fortunately, fades, or nulls, are very localized in space and frequency: a change in the 
transmitter or receiver location (on the order of a carrier wavelength) or in the frequency 
(on the order of the inverse of the propagation delay spread) leads to a roughly indepen- 
dent realization of the fading process. Motivated by this selectivity, the concept of diversity 
is borne: rather than making the success of a transmission entirely dependent on a single 
fading realization, hedge the transmission's success across multiple realizations in order 
to decrease the probability of failure. Hedging or diversifying are almost universal ac- 
tions in the presence of uncertainty, instrumental not only in communications but also in 
other fields as disparate as economics or biology. 

In communications specifically, the term 'diversity' has, over time, acquired different 
meanings, to the point of becoming overloaded. It is used to signify: 

• Variations of the underlying channel in time, frequency, space, etc. 

• Performance metrics related to the error probability. Adding nuance to the term, more 
than one such metric can be defined (cf. Section HVll . 

• Transmission and/ or reception techniques designed to improve the above metrics. 

In this paper, we carefully discriminate these meanings. We use 'selectivity' to refer to 
channel features, which are determined by the environment (e.g., propagation and user 
mobility) and by basic system parameters (e.g., bandwidth and antenna spacing). In turn, 
the term 'diversity' is reserved for performance metrics and for specific transmit/ receive 
techniques, both of which have to do with the signal. Note that channel selectivity is a 
necessary condition for diversity strategies to yield an improvement in some diversity 
metric. 



A Diversity over Time 

Archaic electrical communication systems from a century ago already featured primitive 
forms of diversity, where operators manually selected the receiver with the best quality. 
Automatic selection of the strongest among various receivers was discussed as early as 
1930 [IJ. This naturally led to the suggestion of receive antenna combining, initially for 
microwave links |I3-|[5l. MRC, by far the most ubiquitous combining scheme, was first 
proposed in 1954 ||6|]. In addition to receive antenna combining, other approaches such as 
the aforementioned one of repeating the signal on two or more frequency channels were 
also considered for microwave links [7J. (Systems were still analog and thus coding and 
interleaving was not an option.) Given the cost of spectrum, though, approaches that 
consume additional bandwidth were naturally unattractive and thus the use of antennas 
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quickly emerged as the preferred diversity approach. Recognizing this point, receive 
antenna combining was debated extensively in the 1950's |[8 | -|[TT | and has since been 
almost universally adopted for use at base station sites. The industry, however, remained 
largely ambivalent about multiple antennas at mobile devices. Although featured in early 
AMPS trials in the 1970's, and despite repeated favorable studies (e.g., 112J), until recently 
its adoption had been resisted^ 

Multiple base station antennas immediately allow for uplink receive diversity. It is less 
clear, on the other hand, how to achieve diversity in the downlink using only multiple 
transmit antennas. In Rayleigh fading, transmitting each symbol from every antenna 
simultaneously is equivalent to using a single transmit antenna fMl Section 7.3.2]. Sub- 
optimal schemes were formulated that convert the spatial selectivity across the transmit 
antennas into effective time or frequency selectivity. In these schemes, multiple copies of 
each symbol are transmitted from the various antennas, each subject to either a phase shift 
IITSi or a time delay fT6l. From the standpoint of the receiver, then, the effective channel 
that the signal has passed through displays enhanced time or frequency selectivity and 
thus a diversity advantage can be reaped with appropriate coding and interleaving (cf. 
Section I-C). 

More refined transmit diversity techniques did not develop until the 1990's. Pioneered 
in IIZI, these techniques blossomed into STBC (space-time block codes) |[T8| and, subse- 
quently, onto space-time codes at large. Albeit first proposed for single-antenna receivers, 
STBC's can also be used in MIMO (multiple-input multiple-output) communication, i.e., 
when both transmitter and receiver have a multiplicity of antennas. This yields additional 
diversity, and thus reliability, but no increases in the number of information symbols per 
MIMO symbol. 

Concurrently with space-time coding, the principles of spatial multiplexing were also for- 



mulated in the 1990's |[T9 | -|[22 | . The tenet in spatial multiplexing is to transmit different 
symbols from each antenna, and have the receiver discriminate these symbols by tak- 
ing advantage of the fact that each transmit antenna has a different spatial signature at 
the receiver (because of spatial selectivity). This does allow for an increased number of 
information symbols per MIMO symbol, but it does not enhance reliability. 

Altogether, the powerful thrust promised by MIMO is finally bringing multiantenna de- 
vices to the marketplace. Indeed, MIMO is an integral feature of emerging wireless sys- 
tems such as 3GPP LTE (Long-Term Evolution) ||23B , 3GPP2 Ultra Mobile Broadband, and 
IEEE 802.16 WiMAX II24I. 



^The sole exception was the Japanese PDC system fT3l , which supported dual-antenna terminals since 
the early 1990's. 
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B Overview of Work 

With the advent of MIMO, it may seem that a choice needs to be made between transmit 
diversity techniques, which increase rehabiHty (decrease probability of error), and spatial 
multiplexing techniques, which utilize antennas to transmit additional information but 
do not increase reliability. Applications requiring extremely high reliability seem well 
suited for transmit diversity techniques whereas applications that can smoothly handle 
loss appear better suited for spatial multiplexing. It may further appear that the SNR 
(signal-to-noise ratio) and the degree of channel selectivity should also affect this decision. 

Our findings, however, differ strikingly from the above intuitions. The main conclusion 
is that techniques utilizing all available spatial degrees of freedom for multiplexing out- 
perform, at operating points of interest for modern wireless systems, techniques that ex- 
plicitly sacrifice spatial multiplexing for transmit diversity Thus, from a performance 
perspective there essentially is no decision that need be made between transmit diver- 
sity and multiplexing in contemporary MIMO systems. This conclusion holds even when 
suboptimal spatial multiplexing techniques are used. 

There are a number of different arguments that lead to this conclusion, and which will be 
elaborated upon: 

• Modern systems use link adaptation to maintain a target error probability and there 
is essentially no benefit in operating below this target. This makes diversity metrics, 
which quantify the speed at which error probability is driven to zero with the signal- 
to-noise ratio, beside the point. 

• Wireless channels in modem systems generally exhibit a notable amount of time and 
frequency selectivity, which is naturally converted into diversity benefits through cod- 
ing and interleaving. This renders transmit antenna diversity unnecessary. 

• Block error probability is the relevant measure of reliability. Since the channel codes 
featured in contemporary systems allow for operation close to information-theoretic 
limits, such block error probability is well approximated by the mutual information 
outages. Although uncoded error probability is often quantified, this is only an indi- 
rect performance measure, and incorrect conclusions are sometimes reached by con- 
sidering only uncoded performance. 

It is also imperative to recognize that the notion of diversity is indelibly associated with 
channel uncertainty. If the transmitter has instantaneous CSI (channel-state information), 
then it can match its transmission to the channel in such a way that the error probability 
depends only on the noise. Diversity techniques, which aim precisely at mitigating the 
effects of channel uncertainty, are then beside the point. Although perhaps evident, this 
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point is often neglected. In some models traditionally used to evaluate diversity tech- 
niques, for instance, the channel fades very slowly yet there is no transmitter adaptation. 
As we shall see, these models do not reflect the operating conditions of most current sys- 
tems. 



C Time/Frequency Diversity v. Antenna Diversity 

Perhaps the simplest manifestation of the efficacy of diversity in the aforementioned tra- 
ditional settings is receive antenna combining: if two receive antennas are sufficiently 
spaced, the same signal is received over independently faded paths. Even with simple 
selection combining, this squares the probability of error; optimal MRC (maximum-ratio 
combining) performs even better. 

Based upon the specifics of receive antenna combining, it may appear that multiple, in- 
dependently faded copies of the same signal are required to mitigate fading. Although 
this is an accurate description of receive combining, it is an overly stringent requirement 
in general. This point is clearly illustrated if one considers a frequency-selective chan- 
nel. One simple but naive method of mitigating fading in such a channel is to repeat the 
same signal on two sufficiently spaced frequency channels. Unlike receive combining, 
this technique doubles the number of symbols transmitted and therefore the necessary 
bandwidth. Is repetition, which seems inefficient, the only way to take advantage of fre- 
quency diversity? It is not — if coding is taken into consideration. By applying a channel 
code to a sequence of information bits, the same benefit is gained by transmitting different 
portions of the coded block over different frequency channels. No repetition is necessary; 
rather, information bits are coded and interleaved, and then the first half of the coded 
block is transmitted on the first frequency and the other half on the second frequency. 
The information bits can be correctly decoded as long as both frequencies are not badly 
faded. The same principle applies to time selectivity: instead of repeating the same sig- 
nal at different time instants, transmit a coded and interleaved block over an appropriate 
time period [25J@ 

^When explaining the exploitation of selectivity through coding and interleaving, it is important to dis- 
pel the misconception that channel coding incurs a bandwidth penalty. If the constellation is kept fixed, 
then coding does reduce the rate relative to an uncoded system. However, there is no rate penalty if the 
constellation size is flexible as in modern systems. For instance, a system using QPSK with a rate-1/2 binary 
code and an uncoded BPSK system both have an information rate of 1 bit/ symbol. For a reasonably strong 
code, though, the coded system will achieve a considerably smaller bit error probability than the uncoded 
one. More importantly, the advantage of the coded system in terms of block error probability is even larger 
and this advantage increases with blocklength: the block error probability of a coded system decreases with 
the blocklength whereas, without coding, it actually increases with the blocklength. As will be emphasized 
throughout the paper, modern wireless systems cannot be conceived without powerful channel coding. 
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II Modeling Modem Wireless Systems 



Wireless systems have experienced dramatic changes as they evolved from their initial 
analog forms to today's advanced digital formats. Besides MIMO, features of modern 
systems — that in many cases were completely absent in earlier designs — include: 

• Wideband channelizations and OFDM. 

• Packet switching, complemented with time- and frequency-domain scheduling for 
low- velocity users. 

• Powerful channel codes ||26ll27il28l. 

• Link adaptation, specifically rate control via variable modulation and coding 1129 L 

• ARQ (automatic repeat request) and H-ARQ (hybrid- ARQ) [30J. 

These features have had a major impact on the operational conditions: 

• There is a target block error probability, on the order of 1%, at the output of the decoder. 
(When H-ARQ is in place, this target applies at termination.) Link adaptation loops 
are tasked with selecting the rate in order to maintain performance tightly around this 
operating point. The rationale for this is two-fold: 

i) There is little point in spending resources pushing the error probability on the traffic 
channels much below the error probability on the control plane, which, by its very 
nature (short messages and tight latency requirements), cannot be made arbitrarily 
small. 

ii) Lower error probabilities often do not improve end-to-end performance: in some 
applications (e.g., voice) there is simply no perceivable improvement in the user ex- 
perience while, in others (e.g., data communication requiring very high reliability), 
it is more cost effective to let the upper protocol layers handle the losses. 

• The fading of low- velocity users can be tracked and fed back to the transmitter thereby 
allowing for link adaptation to the supportable rate, scheduling on favorable time/ frequency 
locations, and possibly beamforming and precoding. 

• The channels of high-velocity users vary too quickly in time to allow for feedback of 
CSI or even of the supportable rate. Thus, the signals of such users are dispersed over 
the entire available bandwidth thereby taking advantage of extensive frequency selec- 
tivity. In addition, time selectivity is naturally available because of the high velocity. 
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The above points evidence the disparity between the low- and high-velocity regimes and 
hence, in order to organize the discussion, it is necessary to distinguish between them. 



Ill The Low- Velocity Regime 

At low velocities, timely feedback regarding the current state of the channel becomes 
feasible. This fundamentally changes the nature of the communication problem: all un- 
certainty is removed except for the noise. With powerful coding handling that remaining 
uncertainly, outages are essentially eliminated^ Transmit diversity techniques, whose 
goal is precisely to reduce outages, become inappropriate. Rate maximization becomes 
the overriding transmission design principle, and the optimum strategy in this known- 
channel setting is spatial waterfilling [1211 . 

Although the above consideration posited perfect CSI at the transmitter, it also extends to 
imperfect-CSI settings (caused by limited rate and/ or delay in the link adaptation loop). 
At a minimum, the supportable rate can be fed back; this still removes outages. Ad- 
ditional CSI feedback enables adaptive techniques such as scheduling, power control, 
beamforming and precoding [STlH 

In multiuser settings, furthermore, CSI feedback is collected from many users and time- 
and frequency-domain scheduling offers additional degrees of freedom. In this case, 
transmit diversity techniques can actually be detrimental because they harden the possi- 
ble transmission rates to different users thereby reducing potential multiuser scheduling 
gains mM- 

These conclusions apply almost universally to indoor systems, which conform to this 
low-velocity regime, as long as their medium-access control features the necessary func- 
tionalities. In outdoor systems, they apply to stationary and pedestrian users. 



IV The High- Velocity Regime 

Having established that diversity is not an appropriate perspective in the low-velocity 
regime, we henceforth focus exclusively on the high-velocity regime. This is the regime 

^The rate supported by the channel may be essentially zero at some time /frequency points, but with 
proper link adaptation this does not constitute an outage in the sense that no data is lost (cf. Eq. |4ll. 

^Feedback mechanisms are sometimes studied under the assumption that they convey information re- 
garding the transmit strategy, e.g., which beamformer or precoder to use, but not regarding rate selection, 
in which case outages still occur. This, however, is not well aligned with modern system designs in which 
rate control is paramount. 
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of interest for vehicular users in outdoor systems. At high velocities, the fading (and 
therefore the time-varying mutual information) is too rapid to be tracked. The link adap- 
tation loops can therefore only match the rate to the average channel conditions. The 
scheduler, likewise, can only respond to average conditions and thus it is not possible to 
transmit only to users with favorable instantaneous channels; we thus need not distin- 
guish between single-user and multiuser settings. 



A Channel Model and Performance Metrics 

Let and denote, respectively, the number of transmit and receive antennas. Assum- 
ing that OFDM (orthogonal frequency division multiplexing), the prevalent signalling 
technique in contemporary systems, is used to decompose a possibly frequency-selective 
channel into parallel, non-interfering tones, the received signal on the ith tone is 

Yi = HiXi + Hi (1) 

where Hj is the x channel matrix on that tone, is the n^xl received signal, Hj is the 
TiR X 1 thermal noise, IID circularly symmetric complex Gaussian with unit variance, and 
Xj is the TT-T X 1 transmitted signal subject to a power constraint snr, i.e., i?[||xj|p] < snr. 
The receiver has perfect knowledge of the N channel matrices (the joint distribution of 
which is specified later). 

For a particular realization of Hi, . . . , H^v, the average mutual information thereon is 

1 ^ 

X(SNR) = -5^/(x,;y,). (2) 

i=l 

This quantity is in bits per (complex) modulation symbol, and thus it represents spectral 
efficiency in bits/s/Hz under the standard assumption of one symbol/ s/Hz. The mutual 
information on each tone is determined by the chosen signal distribution. If the signals 
are IID complex GaussiarB with _E'[x,;x|] = ^^I, then 

/(x,; y,) = log2 det (l + ^ H,h1) • (3) 

Since approaching this mutual information may entail high complexity, simpler MIMO 
strategies with different (lower) mutual informations are often used. Expressions for 
these are given later in this Section. 

^Actual systems use discrete constellations, for which counterparts to Q exist in integral form [34J. As 
long as the cardinality of the constellation is large enough relative to the SNR, the gap between the actual 
mutual information and ^ is small and inconsequential to our conclusions. 
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Once a transmission strategy has been specified, the corresponding outage probability for 
rate R (bits /s /Hz) is then 

Pout(SNR, R) = Pr{J(SNR) < R}. (4) 

With suitably powerful codes, the error probability when not in outage is very small and 
the outage probability is an accurate approximation for the actual block error probability 
||35| - ||37| . We shall therefore use both notions interchangeably henceforth. 

As justified in Section [III modem systems operate at a target error probability Hence, the 
primary performance metric is the maximum rate, at each snr, such that this target is not 
exceeded, i.e., 

Re{sm) = max{C : Pout(sNR, () < e} (5) 

where e is the target. 



B The Diversity-Multiplexing Tradeoff 

A clear tradeoff exists between rate, outage and snr. Traditionally, notions of diversity 
order study the speed at which error probability decreases (polynomially) as snr is taken 
to infinity while R is kept fixed as in (jU). Although meaningful in early wireless systems, 
where R was indeed fixed, this is not particularly indicative of contemporary systems in 
which R is increased with snr. 

An alternative formulation was introduced in [38l], where R increases with snr according 
to some function R = /(snr). The multiplexing gain is defined as 

r= lim (6) 

SNR^oo log SNR 

which is the asymptotic slope of the rate-SNR curve in bits/s/Hz/ (3 dB), while the diver- 
sity order is defined as 

rf=- lim 10S^O^;(SNR,/(SNR))_ 

SNR^oo log SNR 

Given a number of transmit and receive antennas, diversity and multiplexing are con- 
flicting objectives as succinctly captured by the DMT (diversity-multiplexing tradeoff) 
||38| . Formulated for a quasi-static channel model where each coded block is subject to a 
single realization of the fading process, the DMT specifies that, with transmit and 
receive antennas, min(nT, n^) + 1 distinct DMT points are feasible, each corresponding to 
a multiplexing gain < r < min(nT, n^) and a diversity order 

(i(r) = (riT - r)(nR - r). (8) 
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Stated simply, if the rate is increased with snr as r logSNR then the outage can decrease no 
faster than sNR"("T-f )(nH-i") This is the optimum DMT; then, each specific transmit-receive 
architecture is associated with a DMT that may or may not achieve this optimum. 

Note that, in dS]) and throughout the paper, d quantifies only the antenna diversity order as 
per the asymptotic definition in 0. If the coded block spans several fading realizations, 
then this additional time /frequency selectivity leads to larger diversity orders but does 
not increase the maximum value of r [|38ll39| . 

A multiplexing gain r = signifies a rate that does not increase (polynomially) with the 
SNR while d = indicates an outage probability that does not decrease (polynomially) 
with the SNR. 

Although the DMT is a powerful tool, it has clear limitations that stem from the fact that 
the diversity order and the multiplexing gain are only proxies for performance measures 
of real interest (error probability and rate, respectively). The asymptotic nature of the 
definitions of r and d naturally restricts the validity of the DMT insights to the high-power 
regime!^ Even in that regime, the diversity order does not suffice to determine the error 
probability at a given snr. It simply quantifies the speed at which the error probability 
falls with the snr. Similarly, the multiplexing gain does not suffice to determine the rate, 
but it only quantifies how the rate grows with the snr. 

The quantity of interest R^isNR) introduced in ^ corresponds to the d = DMT point. 
From the DMT, all we can infer about it is the value of the asymptotic slope 

lim (9) 

SNR^oo log SNR 

which can, at most, equal min(nT, n^^). Certain architectures achieve this maximum, while 
others fall short of it. The traditional notion of diversity, in turn, provides no information 
about Re because it is defined for some fixed rate. 



C Diversity v. Multiplexing in Modern Systems 

In this high-velocity scenario, frequency-flat analyses are likely to indicate that dramatic 
reductions in outage probability can be had by increasing d. On these grounds, transmis- 
sion strategies that operate efficiently at the full-diversity DMT point have been devel- 
oped. The value of these strategies for modern wireless systems, however, is questionable 
because: 

^Any feature whose effect is non-polynomial in the SNR is immaterial in terms of the DMT. Non- 
asymptotic DMT formulations, valid for arbitrary SNR, have been put forth but they lack the simplicity 
and generality of (|8l I40i,i41il. 
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Table 1: MIMO-OFDM System Parameters 



Tone spacing 


15 kHz 


OFDM Symbol duration 


71.5 fis 


Bandwidth 


10 MHz (600 tones, excluding guards) 


Resource block 


12 tones over 1 ms (168 symbols) 


H-ARQ 


Incremental redundancy 


H-ARQ round spacing 


6 ms 


Max. number H-ARQ rounds 


6 


Power delay profile 


12-ray TU 


Doppler spectrum 


Clarke-Jakes 


Max. Doppler frequency 


185 Hz 


Antenna correlation 


None 



1. The outage need not be reduced below the target error probability. 

2. Diversity is plentiful already: 

i) By the same token that the fading is too rapid to be tracked, it offers time selectivity. 

ii) Since, in this regime, modem systems distribute the signals over large swaths of 
bandwidth, there tends to be abundant frequency selectivity. 

Within the DMT framework, a fixed outage probability corresponds to = 0, i.e., to the 
full multiplexing gain achievable by the architecture at hand. Thus, as recognized in [42], 
the i?e -maximizing architectures for snr oo are those that can attain the maximum 
multiplexing gain r = min(nT, n^). Due to the nature of the DMT, however, this holds 
asymptotically in the snr. The extent to which it holds for snr values of interest in a selec- 
tive channel can only be determined through a more detailed (non-asymptotic) study. To 
shed light on this point, a case study is presented next. 

D Case Study: A Contemporary MIMO-OFDM System 

Let us consider the exemplary system described in Table [1], which is loosely based on 
the 3GPP LTE design [231. (With only slight modifications, this system could be made to 
conform with 3GPP2 UMB or with IEEE 802.16 WiMAX.) Every feature relevant to the 
discussion at hand is modeled: 

• A basic resource block spans 12 OFDM tones over 1 ms. Since 1 ms corresponds to 14 
OFDM symbols, a resource block consists of 168 symbols. In the high- velocity regime 
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Table 2: TU power delay profile 



Delay (/is) 


Power (dB) 





-4 


0.1 


-3 


0.3 





0.5 


-2.6 


0.8 


-3 


1.1 


-5 


1.3 


-7 


1.7 


-5 


2.3 


-6.5 


3.1 


-8.6 


3.2 


-11 


5 


-10 



being considered, the 12 tones are interspersed uniformly over 10 MHz of bandwidth. 
There are 600 usable tones on that bandwidth, guards excluded, and hence every 50th 
tone is allocated to the user at hand while the rest are available for other users 

• Every coded block spans up to 6 H-ARQ transmission rounds, each corresponding to a 
basic resource block, with successive rounds spaced by 6 ms for a maximum temporal 
span of 31 ms. (This is an acceptable delay for most applications, including Voice- 
over-IP.) The H-ARQ process terminates as soon as decoding is possible. An error is 
declared if decoding is not possible after 6 rounds. 

• The channel exhibits continuous Rayleigh fading with a Clarke-Jakes spectrum and a 
180-Hz maximum Doppler frequency. (This could correspond, for example, to a speed 
of 100 Km/h at 2 GHz.) The power delay profile is given by the 12-ray TU (typical 
urban) channel detailed in Table |2l The r.m.s. delay spread equals 1 fis. 

• The antennas are uncorrelated to underscore the roles of both diversity and multiplex- 
ing. Some comments on antenna correlation are put forth later in the section. 

The impulse response describing each of the rirj^n^ entries of the channel matrix is 

12 

h{t,T) = Y,V^c,{t)6{t~r,) (10) 
i=i 

^For low velocity users, in contrast, the 12 tones in a resource block are contiguous so that their fad- 
ing can be efficiently described and fed back for link adaptation and scheduling purposes as discussed in 
Section im 
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where the delays {r,}]^]^ and the powers {aj}^"^^ are specified in Table |2] and {cj(t)}]^i 
are independent complex Gaussian processes with a Clarke-Jakes spectrum. Although 
time-varying, the channel is suitably constant for the duration of an OFDM symbol such 
that it is meaningful to consider its frequency response as in ([T]). 

The variability of the channel response over the multiple tones and H-ARQ rounds of a 
coded block is illustrated in Fig. [Tl Note the very high degree of frequency selectivity and 
how the channel decorrelates during the 6 ms separating H-ARQ rounds. 

Without H-ARQ, rate and outage are related as per dH). With H-ARQ, on the other hand, 
the length of each coded block becomes variable. With IR (incremental redundancy) 
specifically, mutual information is accumulated over successive H-ARQ rounds [43 J. If 
we let A^fc(sNR) denote the mutual information after k rounds, then the number of rounds 
needed to decode a particular block is the smallest integer K such that 

Mk{snr) > QR,{snr) (11) 

where < 6. A one-bit notification of success /failure is fed back after the receiver at- 
tempts to decode following each H-ARQ round. An outage is declared if A^6(snr) < 
6 Re{sNR) and the effective rate (long-term average transmitted rate) is 

The initial rate is selected such that the outage at H-ARQ termination is precisely e = 1%. 
This corresponds to choosing an initial rate of 6 where R^ corresponds to the quantity 
of interest defined in © with the mutual information averaged over the 168 symbols 
within each H-ARQ round and then summed across the 6 rounds. 

In order to contrast the benefits of transmit diversity and spatial multiplexing, we shall 
evaluate two representative transmission techniques: 

• A transmit diversity strategy that converts the MIMO channel into an effective scalar 
channel with signal-to-noise ratio 

— Tr{H,(fc)Hl(A:)| (13) 

where Hj(A;) denotes the channel for the 2th symbol on the kth H-ARQ round. By 
applying a strong outer code to this effective scalar channel, the mutual information 
after k rounds is, at most 1381 

fc ^ 168 / x 

-^^(SNR) = E El°g 1 + ^ Tr {h,(£)h1(£)} (14) 

e=i i=i V T / 
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(a) 




Figure 1: (a) TU channel fading realization over 600 tones. The circles indicate the loca- 
tions of the 12 tones that map to a given resource block, (b) TU channel fading realization, 
for a given tone, over 30 ms. The circles indicate the locations of the 6 H-ARQ rounds. 
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Transmit diversity strategies provide full diversity order with reduced complexity, but 
their multiplexing gain cannot exceed r = 1, i.e., one information symbol for every 
vector Xj in ([1]). Note that, when = 2, ((T4|) is achieved by Alamouti transmission 



• A basic MMSE-SIC spatial multiplexing strategy where a separate coded signal is 
transmitted from each antenna, all of them at the same rate |[44|J^ The receiver at- 
tempts to decode the signal transmitted from the first antenna. An MMSE filter is 
applied to whiten the interference from the other signals, which means that the first 
signal experiences a signal-to-noise ratio 



during the kth H-ARQ round. If successful, the effect of the first signal is subtracted 
from the received samples and decoding of the second signal is attempted, and so 
forth. No optimistic assumption regarding error propagation is made: an outage is de- 
clared if any of the coded signals cannot support the transmitted rate. The aggregate 
mutual information over the antennas after k H-ARQ rounds is 



where hi^mW is the mth column of Hi(£) and Hi,m(£) = Wrn+i{i)iii,m+2{i) ■ ■ ■ hi_„^(£)]. 
While deficient in terms of diversity order, this strategy yields full multiplexing gain, 
r = min(nT, Wr), when d = 0. This MMSE-SIC structure is representative of the single- 
user MIMO mode in LTE II2311 . 

Let r^T = «H = 4, the high-end configuration for LTE, and consider first a simplistic model 
where the fading is frequency-flat and there is no H-ARQ. Every coded block is there- 
fore subject to a single realization of the Rayleigh fading process. Under such model, 
the spectral efficiencies achievable with 1% outage, 7?.o.oi(snr), are compared in Fig. |2] 
alongside the corresponding efficiency for the non-MIMO reference (riT = 1, = 4). 
Transmit diversity is uniformly superior to spatial multiplexing in the snr range of inter- 
est. In fact, spatial multiplexing results in a loss with respect to non-MIMO transmission 
with the same number of receive antennas. The curves eventually cross, as the DMT 
predicts and the inset in Fig. |2] confirms (the asymptotic slope of spatial multiplexing is 
r = 4 bits/ s/Hz/ (3 dB) while r = 1 for transmit diversity and for non-MIMO), but this 
crossover does not occur until beyond 30 dB. 

^Separate rate control of each coded signal based on instantaneous channel conditions can make this 
strategy optimal in terms of outage [45 1, but this is rnfeasible in this high-velocity regime. 
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Figure 2: Main plot: MMSE-SIC spatial multiplexing v. transmit diversity with tit = 
rift = 4 in a frequency-flat Rayleigh-faded channel with no H-ARQ. Also shown is the 
non-MIMO reference (tit = 1, = 4). Inset: Same curves over a wider snr range. 
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Figure 3: MMSE-SIC spatial multiplexing v. transmit diversity with n^, = = Ain the 
channel described in Tables [T]-(2l Also shown is the non-MIMO reference (n^ = 1, = 4). 

Still with = = 4, consider now the richer model described in Tables [iHIl The 
effective mutual information for each block is averaged over tones and symbols and ac- 
cumulated over H-ARQ rounds. The corresponding comparison is presented in Fig. |3l 
In this case, transmit diversity offers a negligible advantage whereas spatial multiplexing 
provides ample gains with respect to non-MIMO. 

The stark contrast between the behaviors observed under the different models can only be 
explained by the abundant time and frequency selectivity neglected by the simple model 
and actually present in the system. This renders transmit antenna diversity superfluous, 
not only asymptotically but at every snr. Under the frequency-flat model, the signal from 
the first antenna in the spatial multiplexing transmission does not benefit from any spa- 
tial diversity and thus a low rate must be used so that this signal (and the subsequent 
ones) can be decoded with sufficient probability. Under the richer channel model, how- 
ever, the first signal reaps diversity from time /frequency selectivity and thus the lack 
of spatial diversity is essentially inconsequential. This behavioral contrast is, moreover, 
highly robust. Even if the speed is reduced down to where the low- velocity regime might 
start, as in Fig. HI, the behaviors are hardly affected because there is still significant selec- 
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Figure 4: MMSE-SIC spatial multiplexing, transmit diversity and non-MIMO transmis- 
sion as function of velocity for the channel described in Tables [T]-(2] at snr = 20 dB. (Below 
some point, the system transitions to the low-velocity regime and thus the curves are no 
longer meaningful.) 

tivity. Likewise, the performances are largely preserved if the bandwidth is diminished 
significantly below 10 MHz or the delay spread is reduced below 1 /is. 



E Ergodic Modeling 

As it turns out, the time /frequency selectivity in modern systems is so substantial as to 
justify the adoption of an ergodic model altogether. Shown in Fig.|5]is the correspondence 
between the exact rates achievable with 1% outage in the channel described in Tables [l]-|2] 
and the respective ergodic rates. 

From a computational standpoint, this match is welcome news because of the fact that 
convenient closed forms exist for the rates achievable in an ergodic Rayleigh-faded chan- 
nel l|46ll . Moreover, the optimum transmission strategies and the impact upon capacity 
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Figure 5: In solid lines, 1%-outage rate achievable with MMSE-SIC spatial multiplexing 
in the channel described in Tables [T]-(2l In circles, corresponding ergodic rate for the same 
numbers of antennas. 

of more detailed channel features such as antenna correlation. Rice factors, colored out- 
of-cell interference, etc, can then be asserted by virtue of the extensive body of results 
available for the ergodic setting ||47ll48B . 

Antenna correlation, for example, leads to a disparity in the distribution of the spa- 
tial eigenmodes that effectively reduces the spatial multiplexing capability Such effects 
should, of course, be taken under consideration when determining the appropriate trans- 
mission strategy 

F Optimal MIMO Detection 

While in the case study we considered the performance of a low complexity but subop- 
timal detection scheme for spatial multiplexing, the continual increase in computational 
power is now rendering optimal or near-optimal MIMO detection feasible. Rather than 
transmitting separate coded signals from the antennas, a single one can be interleaved 
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over time, frequency and the transmit antennas. At the receiver side, each vector symbol 
is then fed to a detector that derives soft estimates of each coded bit — possibly by use 
of a sphere decoder — to a standard outer decoder (e.g., message-passing decoder), with 
subsequent iterations between the MIMO detector and the decoder [49 J. Such techniques, 
and others such as mutual information lossless codes Ii50i , can approach the mutual in- 
formation in (|3l)|^ 

In the context of our comparison between transmit diversity and spatial multiplexing, it 
is worthwhile to note that the mutual information in © is greater than that of transmit 
diversity for any channel matrix H. Denoting by the £th eigenvalue of HH^, 

logdet(l + ?;^^HHt) = log j'n (l + !^A,) 

SNR , \ 

= log(l + 5??Tr{HHt} 

where (|T6l) holds because the determinant equals the product of the eigenvalues, (|T7|) 
comes from dropping terms in the product, and ((18)) follows from Tr {HH^} = Yl^=i 

Hence, spatial multiplexing with optimal detection is uniformly superior to transmit di- 
versity: there truly is no decision to be made between the two architectures if optimal 
MIMO detection is an option. Drawing parallels with the discussion in Section |I] about 
the suboptimality of repeating the same signal on two frequency channels versus trans- 
mitting different portions of a coded block thereon, one could equate transmit diversity 
with the former and the optimum MIMO strategy with the latter. 

There is another interesting parallel to our earlier discussion regarding the importance of 
channel modeling. In Fig. |6l the spectral efficiencies of Alamouti transmission and spatial 
multiplexing (with optimal detection and MMSE-SIC) are shown for 2, for both 

the frequency-flat model and the richer model in Tables [l]-(2l Optimal spatial multiplexing 
is superior to Alamouti with both models, as per the above derivation, but the difference 
is considerably larger when the rich model is used. Indeed, based upon the frequency-flat 
model one might incorrectly conclude that spatial multiplexing provides only a negligible 
advantage over Alamouti. Note also that MMSE-SIC performs well below Alamouti in 
the frequency-flat setting, but outperforms it in the rich model. 

'it should be emphasized that these approaches perform optimal or near-optimal detection of each 
MIMO vector symbol, but do not attempt optimal detection of the outer code. Hence, complexity increases 
exponentially with the multiplexing order and the constellation cardinality, but not with the blocklength. 
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Figure 6: Spectral efficiencies achievable with Alamouti transmission and with spatial 
multiplexing (optimal and MMSE-SIC) for tit = = 2. The comparisons are shown for 
both a frequency-flat channel without H-ARQ and for the channel described in Tables [l]-|2l 
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Figure 7: Uncoded symbol error probability for transmit diversity and spatial multiplex- 
ing with = = 2. 

V Uncoded Error Probability: A Potentially Misleading Met- 
ric 



Whereas, in the previous section, the superiority of spatial multiplexing relative to trans- 
mit diversity was illustrated in the context of modern wireless systems with powerful 
outer coding, an opposite but incorrect conclusion can be reached if one compares the 
error probabilities of the two schemes in the absence of outer coding. 



Consider 2 for the sake of specificity. Comparisons must be conducted at equal 

SNR and rate, e.g., Alamouti with 16-QAM and spatial multiplexing with 4-QAM. These 
constitute two different space-time modulation formats, both with 4 bits per MIMO sym- 
bol. Fig. IZlpresents the symbol error probabilities, averaged over the fading distribution, 
for a maximum-likelihood detector with no outer coding. The difference in slopes is ex- 
plained by the classical notion of diversity order: for Alamouti the probability of error 
decreases as snr~^, whereas for spatial multiplexing it decreases only as snr^^. Based on 
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these curves, one might conclude that the schemes are roughly equivalently at low and 
moderate snr and that Alamouti is markedly superior at high snr. 

How is the comparison of uncoded error probabilities to be reconciled with the mutual- 
tnformation-based comparison, where spatial multiplexing was found to be decidedly 
better? The answer lies in the outer code. 

In unfaded channels, coding effectively provides a simple horizontal shift of the error 
probability curve. In fading channels, however, the effect of coding is considerably more 
important: not only does it provide such horizontal shift, but it also collects diversity over 
the entire range of symbols spanned by each coded block [|42|. In a system such as the 
one described in Tables [T] and |2l the outer code makes use of frequency selectivity across 
tones and time selectivity across H-ARQ rounds. Without an outer code, on the other 
hand, this selectivity would not be exploited and thus, as the example in Fig [7] attests, 
averaging uncoded error probabilities does not have the same operational meaning of 
averaging mutual informations. Since modem communication systems rely on powerful 
channel codes, inferring their performance on the basis of uncoded error probabilities can 
be a rather misleading proposition. 



VI Conclusion 

Since the 1970's, antenna diversity had been a preferred weapon used by mobile wireless 
systems against the deleterious effect of fading. While narrowband channelizations and 
non-adaptive links were the norm, antenna diversity was highly effective. In modem sys- 
tems, however, this is no longer the case. Link adaptivity and scheduling have rendered 
transmit diversity undesirable for low- velocity users whereas abundant time / frequency 
selectivity has rendered transmit diversity superfluous for high-velocity users. Moreover, 
the prevalence of MIMO has opened the door for a much more effective use of antennas: 
spatial multiplexing. Indeed, the spatial degrees of freedom created by MIMO should 
be regarded as additional 'bandwidth' and, for the same reason that schemes based on 
time/ frequency repetition are not used for they waste bandwidth, transmit diversity tech- 
niques waste 'bandwidth'. 

Of all possible DMT points, therefore, the zero-diversity one stands out in importance. 
Techniques, even suboptimum ones, that can provide full multiplexing are most appeal- 
ing to modem wireless systems whereas techniques that achieve full diversity order but 
fall short on multiplexing gain are least appealing. Our findings further the conclusion 
in ||42|, where a similar point is made solely on the basis of the multiplexing gain for 
frequency-flat channels. Although this conclusion has been reached on the premise that 
the coded error probabilities of discrete constellations are well approximated by the mu- 
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tual information outages of Gaussian codebooks, we expect it to hold in any situation 
where the code operates at a (roughly) constant gap to the mutual information. 

The trend for the foreseeable future is a sustained increase in system bandwidth, which 
is bound to only shore up the above conclusion. LTE, which for our case study was taken 
to use 10 MHz, is already moving towards 20 MHz channelizations. 

At the same time, exceptions to the foregoing conclusion do exist. These include, for ex- 
ample, control channels that convey short messages. Transmit diversity is fitting for these 
channels, which do benefit from a lower error probability but lack significant time /frequency 
selectivity. Other exceptions may be found in applications such as sensor networks or oth- 
ers where the medium access control is non-existent or does not have link adaptation and 
retransmission mechanisms. 

Our study has only required evaluating well-known techniques under realistic models 
and at the appropriate operating points. Indeed, a more general conclusion that can be 
drawn from the discussion in this paper is that, over time, the evolution of wireless sys- 
tems has rendered some of the traditional models and wisdoms obsolete. In particular: 

• Frequency and time selectivity should always be properly modeled. 

• Performance assessments are to be made at the correct operating point, particularly in 
terms of error probability. 

• The assumptions regarding transmit CSI must be consistent with the regime being con- 
sidered. At low velocities, adaptive rate control based on instantaneous CSI should be 
incorporated; at high velocities, only adaptation to average channel conditions should 
be allowed. 

• Coded block error probabilities or mutual information outages, rather than uncoded 
error probabilities, should be used to gauge performance. 

Proper modeling is essential in order to evaluate the behavior of transmission and re- 
ception techniques in contemporary and future wireless systems. As our discussion on 
transmit diversity and spatial multiplexing demonstrates, improper modeling can lead to 
misguided perceptions and fictitious gains. 
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